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Qh! Abstract 

Exploiting the similarities between the Helmholtz wave equation 
and the Klein-Gordon equation, the former is linearized using the 
Feschbach-Villars procedure used for linearizing the Klein-Gordon equa- 
tion. Then the Foldy-Wouthuysen iterative diagonalization technique 
. is applied to obtain a Hamiltonian description for a system with vary- 

^^^^ , ing refractive index. Besides reproducing all the traditional quasi- 

' paraxial terms, this method leads to additional terms, which are de- 

pendent on the wavelength, in the optical Hamiltonian. This alter- 
O I nate prescription to obtain the aberration expansion is applied to the 

' axially symmetric graded index fiber. This results in the wavelength- 

dependent modification of the paraxial behaviour and the aberration 
coefficients. Explicit expression for the modified coefficients of the 
aberration to third-order are presented. Sixth and eighth order Hamil- 
tonians are derived for this system. 
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1 Introduction 



The traditional scalar wave theory of optics (including aberrations to all 
orders) is based on the beam-optical Hamiltonian derived using the Fermat's 
principle. This approach is purely geometrical and works adequately in the 
scalar regime. The other approach is based on the Helmholtz equation which 
is derived from the Maxwell equations; Then one makes the square-root of 
the Helmholtz operator followed by an expansion of the radical [1, 2]. This 
approach works to all orders and the resulting expansion is no different from 
the one obtained using the geometrical approach of the Fermat's principle. 

Another way of obtaining the aberration expansion is based on the al- 
gebraic similarities between the Helmholtz equation and the Klein-Gordon 
equation. Exploiting this algebraic similarity the Helmholtz equation is lin- 
earized in a procedure very similar to the one due to Feschbach-Villars, for 
linearizing the Klein-Gordon equation. This brings the Helmholtz equation 
to a Dirac-like form and then follows the procedure of the Foldy-Wouthuysen 
expansion used in the Dirac electron theory. This approach, which uses the 
algebraic machinery of quantum mechanics, was developed recently [3], pro- 
viding an alternative to the traditional square-root procedure. This scalar 
formalism gives rise to wavelength-dependent contributions modifying the 
aberration coefficients. The algebraic machinery of this formalism is very 
similar to the one used in the quantum theory of charged-particle beam optics, 
based on the Dirac [4]- [6] and the Klein-Gordon [7] equations respectively. 
The detailed account for both of these is available in [8]. A treatment of 
beam optics taking into account the anomalous magnetic moment is avail- 
able in [9]- [12]. 

General expressions for the Hamiltonians are derived without assuming 
any specific form for the refractive index. These Hamiltonians are shown to 
contain the extra wavelength-dependent contributions which arise very nat- 
urally in our approach. We apply the general formalism to the specific exam- 
ples: A. Medium with Constant Refractive Index. This example is essentially 
for illustrating some of the details of the machinery used. The Feschbach- 
Villars technique for linearizing the Klein- Gordon equation is summarized in 
Appendix- A. The Foldy-Wouthuysen transformation technique is outlined in 
Appendix-B. 

The other application, B. Axially Symmetric Graded Index Medium is 
used to demonstrate the power of the formalism. The traditional approaches 
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give six aberrations. Our formalism modifies tliese six aberration coefficients 
by wavelength-dependent contributions. 

The traditional beam-optics is completely obtained from our approach in 
the limit wavelength, A — > 0, which we call as the traditional limit of our 
formalism. This is analogous to the classical limit obtained by taking h — > 
in the quantum prescriptions. The scheme of using the Foldy-Wouthuysen 
machinery in this formalism is very similar to the one used in the quantum 
theory of charged-partide beam optics [4]-[12]. There too one recovers the 
classical prescriptions in the limit Aq — > where Aq = h/po is the de Broglie 
wavelength and po is the design momentum of the system under study. 



2 Traditional Prescriptions 

Recalling, that in the traditional scalar wave theory for treating monochro- 
matic quasiparaxial light beam propagating along the positive 2;-axis, the z- 
evolution of the optical wave function ■0(r) is taken to obey the Schrodinger- 
like equation 

iA^^(r) = H^{r) , (1) 
where the optical Hamiltonian H is formally given by the radical 

H^-{n\r)-piy^\ (2) 

and the refractive index, n{r) = n{x,y,z). In beam optics the rays are 
assumed to propagate almost parallel to the optic-axis, chosen to be z- 
axis, here. That is, <^ pz ^ I and \n{r) — no\ -C hq. The refractive 
index is the order of unity. Let us further assume that the refractive in- 
dex varies smoothly around the constant background value Uq without any 
abrupt jumps or discontinuities. For a medium with uniform refractive index, 
n{r) — no and the Taylor expansion of the radical is 
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In the above expansion one retains terms to any desired degree of accuracy 
in powers of In general the refractive index is not a constant and 

varies. The variation of the refractive index n{r), is expressed as a Taylor 
expansion in the spatial variables x, y with 2;-dependent coefficients. To get 
the beam optical Hamiltonian one makes the expansion of the radical as 
before, and retains terms to the desired order of accuracy in along 
with all the other terms (coming from the expansion of the refractive index 
n(r)) in the phase-space components up to the same order. In this expansion 
procedure the problem is partitioned into paraxial behaviour + aberrations, 
order-by-order. 

In relativistic quantum mechanics too, one has the problem of under- 
standing the behaviour in terms of nonrclativistic limit + relativistic correc- 
tions, order-by-order. In the Dirac theory of the electron this is done most 
conveniently through the Foldy-Wouthuysen transformation. 

Here, we follow a procedure similar to the one used for linearizing the 
Klein-Gordon equation via the Feshbach-Villars linearizing procedure [13]. 
The resulting Feshbach-Villars-like form has an algebraic structure very sim- 
ilar to the Dirac equation. This enables us to make an expansion using the 
Foldy-Wouthuysen transformation technique well-known in the Dirac elec- 
tron theory [14, 15]. The resulting expansion reproduces the above expansion 
in (3) as it should. Furthermore it gives rise to a set of wavelength-dependent 
contributions. The formalism presented here is an elaboration of the recent 
work which provides an alternative to the traditional square-root technique 
of obtaining the optical Hamiltonian [3] . 

Let us start with the wave-equation in the rectilinear coordinate system. 

Let 

^/ = i/j{r)e-'^\ a;>0, (5) 

then 

lv' + ^Ai^ir)^0. (6) 
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At this stage we introduce the wavization, 

d 

-i^Vx — >p^, — ' ' ^'^^ 

where A is the reduced wavelength given hj ^ = A/27r, c = ^uj and n{r) = 
c/v{r). It is to be noted that pq — qp = —iX. This is similar to the com- 
mutation relation in quantum mechanics. In our formalism 1K plays the same 
role which is played by the Planck constant, h in quantum mechanics. The 
traditional beam-optics formalism is completely obtained from our formalism 
in the limit A — > 0. Then, we get. 

Next, we linearize Eq. (8) following a procedure similar to, the one which 
gives the Feshbach-Villars [13] form of the Klein-Gordon equation. To this 
end, let 

- ( ) ■ 

Then, Eq. (8) is equivalent to 

~'^o^z ( tX] )^{^^ {n\r)-pl) n 1 ( I'Li ) (10) 
Next, we make the transformation. 









(11) 



where 



1 M 



M = M-i = ^(^ ^ _^ J , detM = -l. (12) 
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It is to be noted that the transformation matrix M is independent of z. For 
a monochromatic quasiparaxial beam (in forward direction), with leading 
2;-dependence '^{r) ~ exp {in(r)z/^}. Then 




(13) 



Since, \n{r) — no| ^ tt-oj we have ip^ ^ ■?/'_. 
Consequently, Eq. (8) can be written as 





<5 = z^iS'i + H 



— n' 



— n 



'{r))}{iay), 



(14) 



where ay and are, respectively, the y and 2; components of the triplet of 
Pauli matrices. 



It is to be noted that the even-part and odd-part in Hamiltonian (14) differ 
only by a Pauli matrix. This simplifies the commutations a lot as we shall 
see, shortly. The details of the Feshbach-Villars linearizing procedure for the 
Klein-Gordon equation are available in Appendix-A. 
The square of the Hamiltonian is 



as expected. Thus we have have taken the square-root is a different way. 
This has certain distinct advantages over the traditional procedure of directly 
taking the square-root. 



a — 





(16) 
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The purpose of casting Eq. (8) in the form of Eq. (14) will be obvious 
now, when we compare the latter with the form of the Dirac equation 




Hd = moC^(3 + SD + dD 
£d = q(p 
Od — coc ■ n , 



(17) 



where u and I stand for the upper and lower components respectively and 



a 
a 



1 
-1 



1 = 



1 
1 



(18) 



To proceed further, we note the striking similarities between Eq. (14) and 
Eq. (17). In the nonrelativistic positive energy case, the upper components 
\E'u are large compared to the lower components The odd (O) part of 
{Hd — moC^P), anticommuting with (3 couples the large \E'u to \E'; while the 
even (S) part commuting with (3, does not couple them. Using this fact, 
the well known Foldy-Wouthuysen formalism of the Dirac electron theory 
(see, e.g., [15]) employs a series of transformations on Eq. (17) to reach a 
representation in which the Hamiltonian is a sum of the nonrelativistic part 
and a series of relativistic correction terms; |c7f |/moc^ serves as the expansion 
parameter and the nonrelativistic part corresponds to an approximation of 
order up to \c7r\/moC^. The terms of higher order in |c7r|/moC^ constitute 
the relativistic corrections. Examining Eq. (14) we conclude ip^ ^ ip^, 
and the odd operator O, anticommuting with a^, couples the large ip+ with 
the small while the even operator S does not make such a coupling. 
This spontaneously suggests that a Foldy-Wouthuysen- ^zA;e technique can be 
used to transform Eq. (14) into a representation in which the corresponding 
beam optical Hamiltonian is a series with expansion parameter \p±\/tio. The 
correspondence between the beam optical Hamiltonian (14) and the Dirac 
electron theory is summarized in the following table: 
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The Analogy 



Standard Dirac Equation Beam Optical Form 

moc^p + 80 + 00 -noa^ + 8 + 6 

moc^ —no 

Positive Energy Forward Propagation 

Nonrelativistic, |7r| <S rriQC Paraxial Beam, <S uq 

Non relativistic Motion Paraxial Behavior 

+ Relativistic Corrections + Aberration Corrections 

Apphcation of the Foldy-Wouthuysen-/iA;e technique to Eq. (14) involves 
a series of transformations on it and after the required number of transfor- 
mations, depending on the degree of accuracy, Eq. (14) is transformed into 
a form in which the residual odd part can be neglected and hence the upper 
and lower components (■^^ and if) J) are effectively decoupled. In this repre- 
sentation the larger component (■0+) corresponds to the beam moving in the 
-l-z-direction and the smaller component (■0_) corresponds to the backward 
moving component of the beam. 

Using the correspondence between Eq. (14) and Eq. (17) the Foldy- 
Wouthuysen expansion given formally in terms of £ and O leads to the 
Hamiltonian 

Vp) = -noaz + £-^azd\ (19) 

2no 

To simplify the formal Hamiltonian we use, O'^ — + {uq — n^(r))| 

and recall that ^ = 2^ \p\ + {n^ — n^(r))| (Jx. Dropping the the formal 
Hamiltonian in in (19) is expressed in terms of the phase-space variables as: 



n^'^ = -n, + ^^{pl + {nl-n\r))} 

+ ^^{pl + {nl-n\r))}\ (20) 



The details of the Foldy-Wouthuysen iterative procedure are described in de- 
tain in Appendix-B. The lowest order Hamiltonian obtained in this procedure 
agrees with the traditional approaches. 
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To go beyond the expansions in (20) one goes a step further in the Foldy- 
Wouthuysen iterative procedure. To next-to-leading order the Hamiltonian 
is formally given by 



7^(4) 



1 



= -nocr^ + S - - — (TzO 
2no 



1 



8n2 



OA o,£ 



oz , 



8n^ 



^4 



oz I 



(21) 



As before we drop the cr^ and the resulting Hamiltonian in the phase-space 
variable is 



))} 



L{pi + („s-„v))}^ 



%r4 



32n^ 



32n^ 
1 



+ 



^^^,{pi+(n^-n^(.))r 
[pl+{nl-n\r))\ 



(22) 



The Hamiltonian thus derived has all the terms which one gets in the tra- 
ditional square-root approaches. In addition we also get the wavelength- 
dependent contributions. 

The details of the various transforms and the beam optical formalism 
being discussed here turns out to be a simplified analog of the more general 
formalism recently developed for the quantum theory of charged-partide beam 
optics [4]- [12], both in the scalar and the spinor cases, respectively. A very 
detailed description of these transforms and techniques is available in [8] . 
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Now, we can compare the above Hamiltonians with the conventional 
Hamiltonian given by the square-root approach [2]. The square-root ap- 
proach does not give all the terms, such as the one involving the commutator 
of with ^ (n^(r)). Our procedure of linearization and expansion in pow- 
ers of |p_|_|/no gives all the terms which one gets by the square-root expansion 
of (3) and some additional terms, which are the wavelength-dependent terms. 
Such, wavelength-dependent terms can in no way be obtained by any of the 
conventional prescriptions, starting with the Helmholtz equation (6). 

3 Applications 

In the previous sections we presented an alternative to the square-root ex- 
pansion and and we obtained an expansion for the beam-optical Hamilto- 
nian which works to all orders. Formal expressions were obtained for the 
paraxial Hamiltonian and the leading order aberrating Hamiltonian, with- 
out assuming any form for the refractive index. Even at the paraxial level 
the wavelength-dependent effects manifest by the presence of a commutator 
term, which does not vanish for a varying refractive index. 

Now, we apply the formalism to specific examples. First one is the 
medium with constant refractive index. This is perhaps the only problem 
which can be solved exactly in a closed form expression. This is just to illus- 
trate how the aberration expansion in our formalism can be summed to give 
the familiar exact result. 

The next example is that of the axially symmetric graded index medium. 
This example enables us to demonstrate the power of the formalism, repro- 
ducing the familiar results from the traditional approaches and further giving 
rise to new results, dependent on the wavelength. 

3.1 Medium with Constant Refractive Index 

For a medium with constant refractive index, n{r) — ric, we have. 




(23) 
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The Hamiltonian in (23) can be exactly diagonalized by the following trans- 
form, 

= exp [i (±1(7;,) O^' 
= exp [^a^Ve] 

= cosh iVe) T cr^ sinh {VO) . (24) 



We choose, 



tanh {2Ve) = —=-— = ) " < 1 > (25) 



then we obtain, 

pjdiagonal T~ 

= T+{-noa, + Va, + V{iay)}T- 

= -a^{nl~- 2noVY 

= -a,{nl-piy^ (26) 

We next, compare the exact result thus obtained with the approximate one, 
obtained through the systematic series procedure we have developed. We 
define P = ^ {p^ + (^2 -nl)}. Then, 

- -{nl-plY 

pjdiagonal (27) 

Knowing the Hamiltonian, we can compute the transfer maps. The trans- 
fer operator between any pair of points {{z",z') \z" > z'} on the z-axis, is 
formally given by 

\^{z",z')\=f{z",z')\^{z",z')) , (28) 
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with 



z') = nfiz", z') , f{z", z')^i, 



dzH^z) 



T{z",z') = p^exp 

II 

= X - ^ / dzn{z) 

A J z' 



+ ... , 



(29) 



where X is the identity operator and p denotes the path-ordered exponential. 
There is no closed form expression for T[z" ,z') for an arbitrary choice of the 
refractive index n{r). In such a situation the most convenient form of the 
expression for the 2;-evolution operator T{z" , z'), or the 2;-propagator, is 



f{z\ z') = exp 



-^T{z",z') 



(30) 



with 



T{z",z') = / dzn{;i 

J z' 



+- 



dz / dz' \n{z),H{z') 



+ ... , 



(31) 



as given by the Magnus formula [16] which is described in Appendix-C. We 
shall be needing these expressions in the next example where the refractive 
index is not a constant. 

Using the procedure outlined above we compute the transfer operator, 



Uc (^out , ^in) = exp 



^zHc 



ipi 1 (p\V 



exp +'^nAz{l-^-^--\^ 
A I 2ni o \ni 

^Z -^-out -^in ) 



(32) 
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Using (32), we compute the transfer maps 




out 








(33) 



The beam-optical Hamihonian is intrinsically aberrating. Even for simplest 
situation of a constant refractive index, we have aberrations to all orders! 



3.2 Axially Symmetric Graded Index Medium 

The refractive index of an axially symmetric gradcd-index material can be 
most generally described by the following polynomial (see, pp. 117 in [1]) 

n{r) = no + a2{z)r']_ + a4{z)r]_ + a6{z)r\ + as{z)r]_ H , (34) 

where, we have assumed the axis of symmetry to coincide with the optic-axis, 
namely the 2;-axis without any loss of generality. To write the beam-optical 
Hamiltonians we introduce the following notation 

wi{z) = —{2noa2{z)} 

W2{z) = \^al{z) + 2noa4{z)^ 

^3(-2) = {2noae{z) + 2a2{z)a4{z)} 
az 

W4{z) = — ^al{z) + 2a2{z)a6{z) + 2noas{z)^ (35) 
We also use, [A, B]_^ — (AB + BA). The beam-optical Hamiltonian is 
Ti- — -f^o,p + -f^0,(4) + -f^o,(6) + -f^o,(8) 

^^0,(2) + -"0,(4) + -"0,(6) + -"0,(8) 

Ho,p = -no + tt^pI - a2{z)rl 

ZUq 

^0.(4) = ^pl-^{plrl + rlpl)-a4iz)ri 
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0,(6) 



Ho, (8) 



H, 



a) 

0,(2) 

a) 



H, 



0,(4) 
(A) 



0,(6) 
(^) 



0,(8) 



8nl 









p1, 


Plrl 


+_ 



5a2(2;) 

(3al(z) — 4:noa4{z)^ 

- (2al{z) + inoa^iz)) plrlpij 
4 (q;2(^) + noa2{z)a4{z) + nlaQ{ 

- 5al{z) 



«2 



^2 2 



+ {2al{z) + 4noQ;2(2;)Q;4(x 



2 2 ^2 2 



-a8{z)r[ 



16no [ dz 



— (noa2(^))|T 
^2(2) (rlf + frl 



wl{z)r] 



3^' 



32nt 



w,{z) (rir + frl) + {wl{z) + 2w^{z)ws{z)} rl (36) 



Sni 



32n^ 



The reason for partitioning Ti. in the above manner will be clear as we proceed. 
The paraxial transfer maps are formally given by 



{P±) 



out 



P Q 
R S 



{P±) 



(37) 



where P, Q, R and 5" are the solutions of the paraxial Hamiltonian in (36). 
The transfer operator is most accurately expressed in terms of the the 
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paraxial solutions, P, Q, R and S, via the interaction picture [17]. 

T {z , zo) = exp 
= exp 



--T {z,zo) 



+K {z" , z') [pI , (p^ ■ + ■ p^) 

(2;" , /) (p_, ■ r-x + ■ p^)' 
+F (pM + ripi) 

+£> , [rl , (p^ • + • p^)" 

+E {z" , z') r\ 



(38) 



The six aberration coefficients are given by, 



K {z" , ^' 



4ng 
^2 



QS{PS + gi?) - a^{z)PQ' 



W2{z) (Q\PS + gi?) + 2PQ^S 



A iz" , z' 



32r4 



2n^ 



-PQRS - ai{z)P'Q' 



16n^ 



W2(;2) {PQ{PS + QR)) 



wl{z)P'Q' 



32n^ 



d^; < — 5- 
1 8n^ 



^2 _ '^'^i^) / r>2 o2 I /o2 D2^ „, ^'„^D2/-->2 



4n^ 



-iP'S' + Q'R')-a^{z)P'Q' 
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^2 

W2(z) {PQ(PS + QR)) 



D{z",z') = r dz\^E?S -'^^PR{PS+QR)-ai{z)P''Q 

J z' I OTln 477,r| 





—W2{z) {P\PS + QR) + 2p2gi?) 



= r dz\^R'-^^P'R'-a4z)P' 

^2 



W2 



(z) (p'r) 



«^?(^)^'} • (39) 



32n^ 



Thus wc see that the transfer operator and the aberration coefficients are 
modified by ^-dependent contributions. 

The sixth and eighth order Hamiltonians are modified by the presence 
of wavelength-dependent terms. These will in turn modify the the fifth and 
seventh order aberrations respectively [18]- [21]. 



4 Concluding Remarks 

We exploited the similarities between the Helmholtz equation and the Klein- 
Gordon equation to obtain an alternate prescription for the aberration expan- 
sion. In this prescription we followed a procedure due to Feschbach-Villars 
for linearizing the Klein-Gordon equation. After casting the Helmholtz equa- 
tion to this linear form, it was further possible to use the Foldy-Wouthuysen 
transformation technique of the Dirac electron theory. This enabled us to 
obtain the beam-optical Hamiltonian to any desired degree of accuracy. We 
further get the wavelength-dependent contributions to at each order, start- 
ing with the lowest-order paraxial paraxial Hamiltonian. Formal expressions 
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were obtained for the paraxial and leading order aberrating Hamiltonians, 
without making any assumption on the form of the refractive index. 

As an example we considered the medium with a constant refractive index. 
This is perhaps the only problem which can be solved exactly, in a closed 
form expression. This example was primarily for illustrating certain aspects 
of the machinery we have used. 

The second, and the more interesting example is that of the axially 
symmetric graded index medium. For this system we derived the beam- 
optical Hamiltonians to eighth order. At each order we find the wavelength- 
dependent contributions. The fourth order Hamiltonian was used to ob- 
tain the six, third order aberrations coefficients which get modified by the 
wavelength-dependent contributions. Explicit relations for these coefficients 
were presented. In the limit — > 0, the alternate prescription here, repro- 
duces the very well known Lie Algebraic Formalism of Light Optics. It would 
be worthwhile to look for the extra wavelength-dependent contributions ex- 
perimentally. 

The close analogy between geometrical optics and charged-particle has 
been known for too long a time. Until recently it was possible to see this anal- 
ogy only between the geometrical optics and classical prescriptions of charge- 
particle optics. A quantum theory of charged-particle optics was presented 
in recent years [4]- [12]. With the current development of the non-traditional 
prescriptions of Helmholtz optics [3] and the matrix formulation of Maxwell 
optics, using the rich algebraic machinery of quantum mechanics, it is now 
possible to see a parallel of the analogy at each level. The non-traditional 
prescription of the Helmholtz optics is in close analogy with the quantum 
theory of charged-particles based on the Klein-Gordon equation. The matrix 
formulation of Maxwell optics presented here is in close analogy with the 
quantum theory of charged-particles based on the Dirac equation [22]. We 
shall examine the parallel of these analogies in Appendix-D and summarize 
the Hamiltonians in the various prescriptions in Table-A. 
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Appendix A. 
The Feshbach-Villars Form of the 
Klein-Gordon Equation 

The method we have followed to cast the time-independent Klein-Gordon 
equation into a beam optical form linear in suitable for a systematic study, 
through successive approximations, using the Foldy-Wouthuysen-like trans- 
formation technique borrowed from the Dirac theory, is similar to the way 
the time-dependent Klein-Gordon equation is transformed (Feshbach and Vil- 
lars, [13]) to the Schro-dinger form, containing only first-order time deriva- 
tive, in order to study its nonrelativistic limit using the Foldy-Wouthuysen 
technique (see, e.g., Bjorken and Drell, [15]). 

Defining 

the free particle Klein-Gordon equation is written as 

^V^_!^)*. (A.2) 
Introducing the linear combinations 

the Klein-Gordon equation is seen to be equivalent to a pair of of coupled 
differential equations: 

at 2mo 

i^— = (*+ + *_)- moc^*- . (A.4) 

ot zmo 

Equation (A.4) can be written in a two-component language as 
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with the Feshbach-Villars Hamiltonian for the free particle, Hq^ , given by 



FV 



2mo 



2mo 

— moc^ — 



2mo 



TTIqC + 



-CT. + i' 



-cr„ 



For a free nonrelativistic particle with kinetic energy <S moc^, it is seen that 
^+ is large compared to 

In presence of an electromagnetic field, the interaction is introduced 
through the minimal coupling 



P 



n — p — qA , 



qcj). 



(A.7) 



The corresponding Feshbach-Villars form of the Klein-Gordon equation be- 
comes 




= H 

1 
2 



FV 




V 



moc 

moc^az + £ + O 
;:2 



TT 



2mo 



TT 



2mo 



(A.8) 



As in the free-particle case, in the nonrelativistic situation ^^.i. is large com- 
pared to The even term £ does not couple ^+ and ^_ whereas O is odd 
which couples ^+ and Starting from (A.8), the nonrelativistic limit of 
the Klein-Gordon equation, with various correction terms, can be understood 
using the Foldy-Wouthuysen technique (see, e.y., Bjorken and Drell, [15]). 

It is clear from the above that we have just adopted the above technique 
for studying the z-evolution of the Klein-Gordon wavefunction of a charged- 
particlc beam in an optical system comprising a static electromagnetic field. 
The additional feature of our formalism is the extra approximation of drop- 
ping Gz in an intermediate stage to take into account the fact that we are 
interested only in the forward-propagating beam along the z-direction. 
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Appendix-B. 
Foldy-Wouthuysen Transformation 



In the traditional scheme the purpose of expanding the light optics Hamil- 
tonian H — — {n^ir) — ^ in a series using (^p-p^) ^-^^ expansion 
parameter is to understand the propagation of the quasiparaxial beam in 
terms of a series of approximations (paraxial + nonparaxial) . Similar is the 
situation in the case of the charged-particle optics. Let us recall that in rela- 
tivistic quantum mechanics too one has a similar problem of understanding 
the relativistic wave equations as the nonrelativistic approximation plus the 
relativistic correction terms in the quasirelativistic regime. For the Dirac 
equation (which is first order in time) this is done most conveniently using 
the Foldy-Wouthuysen transformation leading to an iterative diagonalization 
technique. 

The main framework of the formalism of optics, used here (and in the 
charged-particle optics) is based on the transformation technique of the 
Foldy-Wouthuysen theory which casts the Dirac equation in a form display- 
ing the different interaction terms between the Dirac particle and and an ap- 
plied electromagnetic field in a nonrelativistic and easily interpretable form 
(see, [14, 23], for a general discussion of the role of the Foldy-Wouthuysen- 
type transformations in particle interpretation of relativistic wave equations). 
In the Foldy-Wouthuysen theory the Dirac equation is decoupled through a 
canonical transformation into two two-component equations: one reduces to 
the Pauli equation in the nonrelativistic limit and the other describes the 
negative-energy states. 

Let us describe here briefiy the standard Foldy-Wouthuysen theory so that 
the way it has been adopted for the purposes of the above studies in optics 
will be clear. Let us consider a charged-particle of rest-mass mo, charge q in 
the presence of an electromagnetic field characterized hy E — —'Vcf) — 
and B — 'V X A. Then the Dirac equation is 

i^-*(r,i) = HD^{r,t) (B.l) 
Hd — mQ(?(5 -\- qcf) -\- cot ■ li 
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= moc^p + S + d 
O — ca ■ TT , 



(B.2) 



where 



a 



a 
a 
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-1 
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1 
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1 " 




" -i " 




' 1 
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1 





> = 


i 







-1 





(B.3) 



with Tz = p — qA, p = —ih'V, and tt^ = (tt^ + tt ^ + tt^) . 

In the nonrelativistic situation the upper pair of components of the Dirac 
Spinor ^ are large compared to the lower pair of components. The opera- 
tor £ which does not couple the large and small components of ^ is called 
'even' and O is called an 'odd' operator which couples the large to the small 
components. Note that 



(B.4) 
C7*, such that 



Now, the search is for a unitary transformation, ^' = ^ - 
the equation for ^' does not contain any odd operator. 

In the free particle case (with = and it — p) such a Foldy-Wouthuysen 
transformation is given by 



^is^^pa-pe^ tan2|p|e 



moc 



(B.5) 



This transformation eliminates the odd part completely from the free particle 
Dirac Hamiltonian reducing it to the diagonal form: 



4^' 



COS \pW H — sm 



CO. ■ p] e ""^ 

(moC^P + CO.- p^ 
Pa ■ p 



X cos \p\9 — 



\P\ 



sin \p\0 
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= (moc^ COS 2|p|6' + c|p| sin 2|p|6') /J^f' 



(B.6) 



In the general case, when the electron is in a time-dependent electromag- 
netic field it is not possible to construct an exp(i5') which removes the odd 
operators from the transformed Hamiltonian completely. Therefore, one has 
to be content with a nonrelativistic expansion of the transformed Hamilto- 
nian in a power series in 1/moC^ keeping through any desired order. Note 
that in the nonrelativistic case, when |p| ^ moc, the transformation oper- 
ator Up — exp(i5') with S ^ — i/3(5/2moc^, where O — ccx. ■ p is the odd 
part of the free Hamiltonian. So, in the general case we can start with the 
transformation 



^1 



2moC^ 



2moC 



(B.7) 



Then, the equation for ^^^^ is 

d 



^4 ' ' 



D 



d_ 
di 



(B.8) 



where we have used the identity ^ (^e"^j e + e^-^ (^e "^^ = ^7 = 0. 
Now, using the identities 



2! 



3!' 



d 



at 
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d 



2! 



3! 



2dA{t)] \ 



dm] 

dt J 



+Ait) 



dt j"\ 



dA{t) _ 1_ 
2! 



dt 
1 

"3! 
1 

"4! 



A{t), 
A{t), 



A{t), 
A{t), 



dA{t) 
dt 

dAjt) 
dt 

A{t), 



dA{t) 
dt 



with A — iSi, we find 



17(1) _ 



Hd - ri^:^ + 1 



dt 



Q ^ hdSi 

Ol , /2 n -; — 

' 2 dt 



1 
"2! 
i 

3! 



^1, 
^1, 



Si, 



(B.9) 



(B.IO) 



Substituting in (B.IO), Ho — moC^P + £ + O, simplifying the right hand 
side using the relations PO — —OP and p£ — Sp and collecting everything 
together, we have 



H 



(1) 



D 



niQC^p + £i + di 



S + 



1 



2moc^ 
1 



pO^' 



2^4 



po^ 
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/5 



2moc2 



(B.ll) 



with £i and Oi obeying the relations pOi = —Oi[3 and I3£i = £i(3 exactly 
like S and O. It is seen that while the term O in if^:) is of order zero with 
respect to the expansion parameter l/niQC^ {i.e., O = O (^{l/rrioc^)^^ the 

odd part of namely Oi, contains only terms of order 1/moC^ and higher 
powers of l/uioc^ {i.e., Oi = O ((1/moc^))). 

To reduce the strength of the odd terms further in the transformed Hamil- 
tonian a second Foldy-Wouthuysen transformation is applied with the same 
prescription: 



^(2) ^ e'^^¥^\ 

yi_ 

2moC^ 

After this transformation, 
d 



2moC^ 



dt J 3mlcf^ 



q3 



at 



D 



where, now, 02 — (^{1 / mQC^)^^ . After the third transformation 



(B.12) 



(B.13) 



^(3) ^ e'^sx^fi^)^ 



2moc2 



we have 



d 



ot 



£2 ~ ^1 , O3 



P 



02,82 +in 



d02 
dt 



(B.14) 



(B.15) 
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where 0^ = O {^{1 / mo(?)^^ . So, neglecting Oa, 



2moc^ 




(B.16) 



It may be noted that starting with the second transformation successive 
{£, O) pairs can be obtained recursively using the rule 



j>i, 



(B.17) 



and retaining only the relevant terms of desired order at each step. 

With £ = q4> and O = ca-TV, the final reduced Hamiltonian (B.16) is, to 
the order calculated. 



H 



(3) 
D 



(5 yniQC? + 



TT 



2mo SmgC^ 



S-curl^;- 



qh 
2moC 



(31: -B 



qh^ 



divE, 



(B.18) 



with the individual terms having direct physical interpretations. The terms 



in the first parenthesis result from the expansion of ymlc'^ + c^tt^ showing 
the effect of the relativistic mass increase. The second and third terms are 
the electrostatic and magnetic dipole energies. The next two terms, taken 
together (for hermiticity), contain the spin-orbit interaction. The last term, 
the so-called Darwin term, is attributed to the zitterbewegung (trembling 
motion) of the Dirac particle: because of the rapid coordinate fluctuations 
over distances of the order of the Compton wavelength (27r^/moc) the particle 
sees a somewhat smeared out electric potential. 

It is clear that the Foldy-Wouthuysen transformation technique expands 
the Dirac Hamiltonian as a power series in the parameter 1/moc^ enabling the 
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use of a systematic approximation procedure for studying the deviations from 
the nonrelativistic situation. We note the analogy between the nonrelativistic 
particle dynamics and paraxial optics: 



The Analogy 



Standeird Dirac Equation 



Beam Optical Form 

—rioaz + £ + 

-no 



Positive Energy 
Nonrelativistic, |7r| <^ rriQC 
Non relativistic Motion 



Forward Propagation 
Paraxial Beam, -C Uq 
Paraxial Behavior 



+ Relativistic Corrections 



+ Aberration Corrections 



Noting the above analogy, the idea of Foldy-Wouthuysen form of the Dirac 
theory has been adopted to study the paraxial optics and deviations from it 
by first casting the Maxwell equations in a spinor form resembling exactly the 
Dirac equation (B.l, B.2) in all respects: i.e., a multicomponent ^ having 
the upper half of its components large compared to the lower components 
and the Hamiltonian having an even part (£), an odd part (O), a suitable 
expansion parameter, {\Px_\/nQ <^ 1) characterizing the dominant forward 
propagation and a leading term with a (5 coefficient commuting with £ and 
anticommuting with O. The additional feature of our formalism is to return 
finally to the original representation after making an extra approximation, 
dropping j3 from the final reduced optical Hamiltonian, taking into account 
the fact that we are primarily interested only in the forward-propagating 
beam. 
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Appendix-C 
The Magnus Formula 

The Magnus formula is the continuous analogue of the famous Baker- 
Campbell-Hausdorff (BCH) formula 

gAgS ^ ^A+B^[A,B]+^{[[A,A],B]+[[A,B],B]}+... _ (Q j) 

Let it be required to solve the differential equation 

= A{t)u{t) (C.2) 

to get u{T) at T > to, given the value of u{tQ); the operator A can represent 
any linear operation. For an infinitesimal At, we can write 

u{to + At)^e^'^^'°^u{to). (C.3) 
Iterating this solution we have 

u{to + 2At) = e^*^(*°+^*)e^*^(*°)M(to) 

u{to + 3At) = QAtA(to+2At)^AtA(to+At)^AtA{to)^^^^^ 

and so on. (C.4) 

If T = to + NAt we would have 



u 



(T) = e^*^(*°+"^*) I u{to) . (C.5) 

I n=0 J 



Thus, m(T) is given by computing the product in (C.5) using successively 
the BCH-formula (C.l) and considering the limit At — > 0,N — > oo such 
that NAt = T — to- The resulting expression is the Magnus formula (Mag- 
nus, [16]) : 



u{T) = f{T,to)u{to) 
r{T,to) = exp|^^dtii(ti) 
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+ 



Z Jtn •/to 



rt2 



'to 

1 



to J to 



+^ / dh 1 dt2 / dh ([U(t3),i(t2)l 

D J to J to J to 

...A. 



(C.6) 



To see how the equation (C.6) is obtained let us substitute the assumed 
form of the solution, u{t) — T {t , to) u (to) , in (C.2). Then, it is seen that 
T{t,to) obeys the equation 



d_ 
di 



f {t, to) = A{t)T {t, to), f {to, to) = T. 



Introducing an iteration parameter A in (C.7), let 
d 



dt 



T(t,to;X) = XA(t)T(t,to;X), 



T(to,to;\) = I, T{t,to;l) = T{t,to). 
Assume a solution of (C.8) to be of the form 

r(i,io;A) =e"(*'*°'^) 

with 

oo 

n{t, to; X) = Y. >^"^n{t, to), Kito, to) = for all n . 



n=l 



Now, using the identity (see, Wilcox, [24]) 



one has 



(C.7) 

(C.8) 
(C.9) 

(C.IO) 
(C.ll) 



|.gQ(t,to;A) ^ J ds^snit,to;X)^QU t^. A)e-^^(*'*0'^) I e"(*'^) , (C.12) 
at [Jo at 



£ c^se^"(*'*°'^)— (](t, to; A)e-^"(*'*°'^) = XA(t) . (C.13) 

Substituting in (A13) the series expression for fl{t,to;X) (C.ll), expanding 
the left hand side using the first identity in (C8), integrating and equating 
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the coefficients of A-' on both sides, we get, recursively, the equations for 
Ai(t,to), A2(t,to),... , etc. For J = 1 



d_ 
di 



Ai{t,to) ^ A{t), Ai(to,to)=0 



and hence 

For j = 2 

d 



Ai(t,io) = f'dhA{t 

Jto 



(C.14) 
(C.15) 



^,A.(Mo) + 2 



d 



= 0, A2(io,io) = (C.16) 



and hence 
Similarly, 



^2{t,to) = ^ /*o?i2 /*' dh \A{t2) , A{t 



(C.17) 



6 Jto 



dh I dh 

to 



+ 



dh {[[A{h) , A{h)] , A{h) 
A{h),A{h)] ,A{h)]} . 



(C.18) 



Then, the Magnus formula in (C.6) follows from (C.9)-(C.ll). Equation 31 
we have, in the context of 2;-evolution follows from the above discussion with 
the identification t — > z, h — ^ z^^\ T — > z'^'^^ and A{t) — > —^7io{z). 

For more details on the exponential solutions of linear differential equa- 
tions, related operator techniques and applications to physical problems the 
reader is referred to Wilcox [24], Bellman and Vasudevan [25], Dattoli et 
al. [26], and references therein. 
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Appendix-D 
Analogies between light optics and 
charged-particle optics: Recent Developments 

Historically, variational principles have played a fundamental role in the 
evolution of mathematical models in classical physics, and many equations 
can be derived by using them. Here the relevant examples are Fermat's prin- 
ciple in optics and Maupertuis' principle in mechanics. The beginning of the 
analogy between geometrical optics and mechanics is usually attributed to 
Descartes (1637), but actually it can traced back to Ibn Al-Haitham Alhazen 
(0965-1037) [27]. The analogy between the trajectory of material particles in 
potential fields and the path of light rays in media with continuously variable 
refractive index was formalized by Hamilton in 1833. The Hamiltonian anal- 
ogy lead to the development of electron optics in 1920s, when Busch derived 
the focusing action and a lens-like action of the axially symmetric magnetic 
field using the methodology of geometrical optics. Around the same time 
Louis de Broglie associated his now famous wavelength to moving particles. 
Schrodinger extended the analogy by passing from geometrical optics to wave 
optics through his wave equation incorporating the de Broglie wavelength. 
This analogy played a fundamental role in the early development of quan- 
tum mechanics. The analogy, on the other hand, lead to the development 
of practical electron optics and one of the early inventions was the electron 
microscope by Ernst Ruska. A detailed account of Hamilton's analogy is 
available in [28] -[29]. 

Until very recently, it was possible to see this analogy only between the 
geometrical-optic and classical prescriptions of electron optics. The reasons 
being that, the quantum theories of charged-particle beam optics have been 
under development only for about a decade [4]- [12] with the very expected 
feature of wavelength-dependent effects, which have no analogue in the tradi- 
tional descriptions of light beam optics. With the current development of the 
non-traditional prescriptions of Helmholtz optics [3] and the matrix formula- 
tion of Maxwell optics, accompanied with wavelength-dependent effects, it is 
seen that the analogy between the two systems persists. The non-traditional 
prescription of Helmholtz optics is in close analogy with the quantum theory 
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of charged-particle beam optics based on the Klein-Gordon equation. The 
matrix formulation of Maxwell optics is in close analogy with the quantum 
theory of charged-particle beam optics based on the Dirac equation. This 
analogy is summarized in the table of Hamiltonians. In this short note it is 
difficult to present the derivation of the various Hamiltonians from the quan- 
tum theory of charged-particle beam optics, which arc all available in the 
references. We shall briefly consider an outline of the quantum prescriptions 
and the non-traditional prescriptions respectively. A complete coverage to 
the new field of Quantum Aspects of Beam Physics (QABP), can be found 
in the proceedings of the series of meetings under the same name [30] . 

D.l Quantum Formalism of Charged-Particle Beam 
Optics 

The classical treatment of charged-particle beam optics has been extremely 
successful in the designing and working of numerous optical devices, from 
electron microscopes to very large particle accelerators. It is natural, however 
to look for a prescription based on the quantum theory, since any physical 
system is quantum mechanical at the fundamental level! Such a prescription 
is sure to explain the grand success of the classical theories and may also help 
get a deeper understanding and to lead to better designing of charged-particle 
beam devices. 

The starting point to obtain a quantum prescription of charged particle 
beam optics is to build a theory based on the basic equations (Schrodinger, 
Klein-Gordon, Dirac) of quantum mechanics appropriate to the situation 
under study. In order to analyze the evolution of the beam parameters of 
the various individual beam optical elements (quadrupolcs, bending mag- 
nets, ■ ■ ■) along the optic axis of the system, the first step is to start with 
the basic time-dependent equations of quantum mechanics and then obtain 
an equation of the form 

d 

i^^V' {x, y; s)^n {x, y; s) ip {x, y; s) , (D.l) 

where {x, y; s) constitute a curvilinear coordinate system, adapted to the 
geometry of the system. Eq. (D.l) is the basic equation in the quantum 
formalism, called as the beam-optical equation; Ti and ip as the beam-optical 
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Hamiltonian and the beam wavefunction respectively. The second step re- 
quires obtaining a relationship between any relevant observable {{0){s)} at 
the transverse-plane at s and the observable {(0)(sin)} at the transverse 
plane at Sin, where Sin is some input reference point. This is achieved by the 
integration of the beam-optical equation in (D.l) 

ilj{x,y;s) = [/ (s, Sin) V' (a;, y; Sin) , (D.2) 

which gives the required transfer maps 

(O) (sin) ^ (O) (s) = {ij{x,y;s)\0\^{x,y;s)) , 

yx, y, Sin )|c/^OC/U(x,y;sin)) . (D.3) 



The two-step algorithm stated above gives an over-simplified picture of 
the quantum formalism. There are several crucial points to be noted. The 
first-step in the algorithm of obtaining the beam-optical equation is not to 
be treated as a mere transformation which eliminates t in preference to a 
variable s along the optic axis. A clever set of transforms are required which 
not only eliminate the variable t in preference to s but also give us the s- 
dependent equation which has a close physical and mathematical analogy 
with the original t-dependent equation of standard time-dependent quantum 
mechanics. The imposition of this stringent requirement on the construc- 
tion of the beam-optical equation ensures the execution of the second-step of 
the algorithm. The beam-optical equation is such that all the required rich 
machinery of quantum mechanics becomes applicable to the computation of 
the transfer maps that characterize the optical system. This describes the 
essential scheme of obtaining the quantum formalism. The rest is mostly 
mathematical detail which is inbuilt in the powerful algebraic machinery 
of the algorithm, accompanied with some reasonable assumptions and ap- 
proximations dictated by the physical considerations. The nature of these 
approximations can be best summarized in the optical terminology as a sys- 
tematic procedure of expanding the beam optical Hamiltonian in a power 
series of \7r±/po\, where po is the design (or average) momentum of beam 
particles moving predominantly along the direction of the optic axis and n± 
is the small transverse kinetic momentum. The leading order approxima- 
tion along with |7r^/po| ^ 1, constitutes the paraxial or ideal behaviour and 
higher order terms in the expansion give rise to the nonlinear or aberrating 
behaviour. It is seen that the paraxial and aberrating behaviour get modified 
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by the quantum contributions which arc in powers of the de Broghe wave- 
length (Ao = h/po). The classical limit of the quantum formalism reproduces 
the well known Lie algebraic formalism [31] of charged-particle beam optics. 

D.2 Light Optics: Various Prescriptions 

The traditional scalar wave theory of optics (including aberrations to all or- 
ders) is based on the beam-optical Hamiltonian derived by using Fermat's 
principle. This approach is purely geometrical and works adequately in 
the scalar regime. The other approach is based on the square-root of the 
Helmholtz operator, which is derived from the Maxwell equations [31]. This 
approach works to all orders and the resulting expansion is no different from 
the one obtained using the geometrical approach of Fermat's principle. As for 
the polarization: a systematic procedure for the passage from scalar to vector 
wave optics to handle paraxial beam propagation problems, completely tak- 
ing into account the way in which the Maxwell equations couple the spatial 
variation and polarization of light waves, has been formulated by analyzing 
the basic Poincare invariance of the system, and this procedure has been 
successfully used to clarify several issues in Maxwell optics [32, 33, 34]. 

The two-step algorithm used in the construction of the quantum theories 
of charged-particle beam optics is very much applicable in light optics! But 
there arc some very significant conceptual differences to be born in mind. 
When going beyond Fermat's principle the whole of optics is completely 
governed by the Maxwell equations, and there are no other equations, unlike 
in quantum mechanics, where there are separate equations for, spin- 1/2, 
spin-1, • ■ •. 

Maxwell's equations arc linear (in time and space derivatives) but cou- 
pled in the fields. The decouphng leads to the Helmholtz equation which is 
quadratic in derivatives. In the specific context of beam optics, purely from 
a calculational point of view, the starting equations are the Helmholtz equa- 
tion governing scalar optics and for a more complete prescription one uses 
the full set of Maxwell equations, leading to vector optics. In the context of 
the two-step algorithm, the Helmholtz equation and the Maxwell equations 
in a matrix representation can be treated as the 'basic' equations, analogue 
of the basic equations of quantum mechanics. This works perfectly fine from 
a calculational point of view in the scheme of the algorithm we have. 

Exploiting the similarity between the Helmholtz wave equation and the 
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Klein-Gordon equation, the former is linearized using the Feshbach-Villars 
procedure used for the linearization of the Klein-Gordon equation. Then the 
Foldy-Wouthuysen iterative diagonalization technique is applied to obtain 
a Hamiltonian description for a system with varying refractive index. This 
technique is an alternative to the conventional method of series expansion of 
the radical. Besides reproducing all the traditional quasiparaxial terms, this 
method leads to additional terms, which are dependent on the wavelength, 
in the optical Hamiltonian. This is the non-traditional prescription of scalar 
optics. 

The Maxwell equations are cast into an exact matrix form taking into 
account the spatial and temporal variations of the permittivity and perme- 
ability. The derived representation using 8x8 matrices has a close algebraic 
analogy with the Dirac equation, enabling the use of the rich machinery 
of the Dirac electron theory. The beam optical Hamiltonian derived from 
this representation reproduces the Hamiltonians obtained in the traditional 
prescription along with wavelength-dependent matrix terms, which we have 
named as the polarization terms [35]. These polarization terms are alge- 
braically very similar to the spin terms in the Dirac electron theory and 
the spin-precession terms in the beam-optical version of the Thomas-BMT 
equation[9]. The matrix formulation provides a unified treatment of beam 
optics and light polarization. Some well known results of light polarization 
are obtained as a paraxial limit of the matrix formulation [32, 33, 34]. The 
traditional beam optics is completely obtained from our approach in the limit 
of small wavelength, ^ — > 0, which we call as the traditional limit of our for- 
malisms. This is analogous to the classical limit obtained by taking h — > 0, 
in the quantum prescriptions. 

From the Hamiltonians in the Table we make the following observations: 
The classical/traditional Hamiltonians of particle/light optics are modified 
by wavelength-dependent contributions in the quantum/non-traditional pre- 
scriptions respectively. The algebraic forms of these modifications in each 
row is very similar. This should not come as a big surprise. The starting 
equations have one-to-one algebraic correspondence: Helmholtz Klein- 
Gordon; Matrix form of Maxwell <-> Dirac equation. Lastly, the de Broglie 
wavelength, Aq, and A have an analogous status, and the classical/traditional 
limit is obtained by taking Aq — > and 1^ — > respectively. The parallel 
of the analogies between the two systems is sure to provide us with more 
insights. 
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Table A. 

Hamiltonians in Different Prescriptions 

The following are the Hamiltonians, in the different prescriptions of light beam op- 
tics and charged-particle beam optics for magnetic systems. Hq ,p are the paraxial 
Hamiltonians, with lowest order wavelength-dependent contributions. 



Light Beam Optics 


Charged-Particle Beam Optics 


Fermat's Principle 

n = - {n^r) - pI}'/' 


Maupertuis' Principle 

n = -{pl-nl}'/'-qA, 


Non- Traditional Helmholtz 


Klein- Gordon Formalism 


Ho,p = 


Hq^p = 

-Po- qA, + 2^7ri 


Maxwell, Matrix 


Dirac Formalism 


Ho,p = 
- iA/3S • u 


-ffo,p = 
-Po- qA^ + 

- ^ {ii-f-E^ .B^ + {q + ii) ^,B,} 

+ i^eB^ 
moc ^ 


Notation 

Refractive Index, n(r) = c^e{r)ii{r) 
Resistance, h{r) = ^^{r)/e{r) 

«W= 2n(r)'^^(^) 

XI and P are the Dirac matrices. 


Ha anomalous magnetic moment. 
€a anomalous electric moment. 
H = 2mo/Ua/^ , e = 2moea/h 
7 = E/moc^ 
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